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METHOD AND SYSTEM FOR IMPROVING USABLE LIFE OF MEMORY 
DEVICES USING VECTOR PROCESSING 

BACKGROUND OF THE INVENTION 

5 

1. Field of the Invention: 

The present invention relates to data storage 
devices, and in particular relates to improved techniques 
for using and managing non-volatile memory storage 
10 devices to enhance lifetime characteristics of such 
devices . 

2. Background of the Invention: 

Memory devices are a key component in today' s 

15 computer systems, allowing high-speed access to retrieve 
and store data. A certain type of memory device known as 
non-volatile memory maintains the correct data value 
stored in the device even when power is removed from the 
device. Because non-volatile memory devices are more 

2 0 expensive than volatile memory devices (which lose data 
when removed from a power source) , their use in a 
computer system is typically limited to situations where 
a need exists to maintain data even when power is removed 
or lost. For example, configuration settings for a 

25 computer are typically stored in non-volatile memory so 
that the system properly boots-up in a proper state when 
powered-up. These configuration settings may include the 
number and type of peripheral devices that are attached 
to a computer, the current date/time, power management 

30 settings, etc. Another use of non-volatile memory in a 
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computer system is to store the initial boot -up code that 
gets executed when the computer is first turned on. This 
initial code, sometimes called BIOS when discussing 
personal computers, provides the computer with enough 
5 programming capability to allow it to initialize a 
computer and read an operating system from a storage 
device such as a hard disk. 

Technological advances and volume production have 
driven the cost of such non-volatile storage devices down 
10 so that other types of devices, such as consumer devices, 
can now cost-effectively include and utilize non-volatile 
memory for their own advantage. Cellular telephones can 
be programmed to store phone lists, for example. Digital 
cameras store snapped photographs in non-volatile memory 
15 devices. Music devices such as MP3 players also use non- 
volatile memory to store music. Because of the reduction 
in price of these non-volatile memory devices due to 
volume production, it is now possible to envision use of 
these devices in mass data storage systems, in lieu of or 
20 in addition to disk arrays or tape library systems. 

One type of non- volatile memory device is known as 
an electrically erasable and programmable read only 
memory (EEPROM) , which is a particular type of integrated 
circuit (IC) device. Another integrated circuit non- 
25 volatile memory device is a non-volatile random access 
memory (NVRAM) , which is a combination of static RAM 
(SRAM) and EEPROM. These devices are made up of an array 
of cells which store data bits that have been programmed 
into the device. The EEPROM device is programming by 
3 0 providing particular voltages to certain portions of 
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transistors (such as gate, drain and source terminals) 
that are used in each cell. The device is read by- 
providing particular, but different, voltages to certain 
portions of transistors. It is also possible to erase 
5 the content of a cell by applying a particular voltage to 
an erase gate. Similarly, a NVRAM is written to by 
writing data into the device using conventional SRAM 
writing techniques, and the SRAM data is copied to the 
EEPROM when power is removed in order to maintain the 
10 data in the EEPROM when no power exists. 

A large number of cells are typically configured to 
form a non-volatile memory device. They may be fashioned 
in a two-dimensional array, with each cell being 
individually addressable for reading or writing. 
15 However, to reduce design complexity (and cost) , groups 
of cells share certain common control lines for erasing 
the cell. The cells that share a common erase control 
are typically called a block of cells, or a block. Block 
sizes can be any size as dictated by a given design, and 
20 may be 128 bits, 256 bits, 1024 bits, etc. All the bits 
in a block are erased at the same time during an erase 
operation using the common erase control line, and this 
is sometimes referred to as erasing *in a flash' . 
Because of this, certain non-volatile memory such as 
2 5 EEPROM that is erased and reprogrammed at the block level 
(as opposed to a byte level) has become known as flash 
memory . 

In what follows the term "physical block" refers to 
a block in the sense just described; that is, a physical 
30 block is the smallest group of cells of flash memory that 



4 

Docket NO.2003-080-SFT 

can be erased or rewritten at one time. A "logical 
block/' on the other hand, refers to the data that is 
stored in a physical block. To simplify the 
explanations, an empty physical block is regarded as a 
5 physical block that contains a logical block that has 
never been modified. 

Certain non-volatile memory devices such as EE PROM 
age over time, and thus have a limit on the number of 
times they can be re -programmed . This 'aging' is not 

10 necessarily caused by how old the device is, but rather 
by how many times the device has been written to. Each 
physical block has a finite lifetime in terms of the 
number of erase/write cycles that it can undergo. For 
present flash memory technology, this endurance limit is 

15 approximately 100,000 to 1,000,000 cycles. Because of 
this limitation, the useful life of a flash memory 
terminates as soon as one physical block has reached its 
endurance limit (unless spare blocks have been held in 
reserve) . For many consumer applications, this 

20 relatively large number is generally not an issue. 

However, when these devices are used in applications 
having a large amount of input and output operations - 
such as when used in conjunction with or in lieu of a 
disk or tape subsystem - it does become an issue. 

25 In those applications where the endurance limit does 

pose a significant restriction on the lifetime of the 
device, the lifetime of the device can be extended by 
taking steps to ensure that wear is distributed more 
evenly over all the physical blocks. To cite an extreme 

3 0 example, if only one logical block is modified frequently 
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and no steps are taken to redistribute the wear, the 
useful lifetime of the device terminates as soon as the 
physical block containing the mentioned logical block 
reaches the endurance limit. However, if the active 
5 logical block is moved from one physical block to another 
at suitable intervals, the lifetime of the memory can be 
multiplied by the number of physical blocks in the flash 
memory. This illustrates the wear- leveling method for 
memory extension. The problem is to devise a wear- 
10 leveling algorithm that can be relied upon to distribute 
wear more evenly, with acceptable overheads, under 
conditions that can be expected to arise in practice. 

One approach to wear leveling for flash EEPROM 
systems is described in US Patent No. 6,230,233. The 
15 EEPROM array is divided into two or more interchangeable 
banks of physical blocks. A memory controller provides 
for interchanging the banks when it is detected that they 
are receiving significantly uneven use. However, this 
technique is deficient in several respects (the following 
20 critique refers to individual physical blocks rather than 
banks merely for simplicity) : 

1. The choice of logical blocks to be moved in a wear- 
leveling operation is based on the cumulative wear of a 
given physical block, which depends on all the logical 

25 blocks that have resided in that physical block since the 
device began operation; the activity level of the logical 
block currently stored in the physical block is thereby 
obscured, leading to suboptimal decisions. For example, 
it would be desirable to populate the most worn physical 

30 block with the least active logical block, but the 
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logical block currently residing in the least worn 
physical block (i.e., the physical block with the 
smallest cumulative wear) is not necessarily the least 
active logical block. 
5 2. Wear-leveling operations can be counterproductive if 
they are performed too frequently: a logical block that 
is frequently modified and has been moved from a physical 
block with high wear to a physical block with low wear 
may be moved back to the physical block with high wear. 
10 In view of these deficiencies in the technique 

taught in US Patent No. 6,230,233, it would be desirable 
to provide a wear-leveling technique for non-volatile 
memory devices that could be relied upon to yield 
satisfactory results with greater consistency. 
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SUMMARY OF THE INVENTION 

The present invention is a method, system and 
apparatus for improving the useful life of non-volatile 
5 memory devices such as flash memory. The present wear- 
leveling technique advantageously improves the overall 
useful life of a flash memory device by strategically 
moving inactive logical blocks (logical blocks that have 
been infrequently modified in the recent past) to the 

10 physical blocks that have experienced the most wear (the 
greatest number of erasures) since the device began 
operation and by strategically moving active logical 
blocks to the physical blocks that have experienced the 
least wear. To track the activity of both physical 

15 blocks and logical blocks, a block descriptor is 

maintained for each physical block. One component of the 
block descriptor, denoted by n, is the total number of 
times the physical block has been erased since the device 
began operation; this component measures the cumulative 

2 0 wear on the physical block. Another component of the 

block descriptor, denoted by An, is the number of times 
the physical block has been erased since the most recent 
wear-leveling event; since logical blocks are moved only 
during wear-leveling events, this component measures the 

25 activity of the logical block that is currently stored in 
the physical block. 

The information processing required to support wear 
leveling employs two vectors that contain the same 
elements but in different orders. The elements of each 

30 vector are pointers to all the block descriptors. One 
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vector is sorted in decreasing order of the cumulative 
wear (n) , while the other vector is sorted in increasing 
order of recent activity (An) . These two vectors contain 
all the information that is required for effective wear 
5 leveling. Furthermore, this information is organized in 
such a way that the most promising opportunities for wear 
leveling can easily be identified by comparing elements 
of the two vectors as these vectors are traversed (using 
pointers into the vectors) in accordance with a suitable 
10 algorithm. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The novel features believed characteristic of 
the invention are set forth in the appended claims. The 
5 invention itself, however, as well as a preferred mode of 
use, further objectives and advantages thereof, will best 
be understood by reference to the following detailed 
description of an illustrative embodiment when read in 
conjunction with the accompanying drawings, wherein: 

10 

Figure 1 depicts a system including a memory controller 
and one or more non-volatile memory devices. 

Figure 2 is a top level diagram of wear level processing 
15 for non-volatile memory devices. 

Figure 3 shows more details of the process flow for 
performing wear leveling. It is an expansion of the 
"perform wear leveling' 7 step 208 in Figure 2. 

20 

Figure 4 depicts a representative block descriptor table 
containing a plurality of block descriptors. 

Figure 5 shows one possible expansion of the "process 
25 vectors" step 308 in Figure 3. 

Figure 6 shows another possible expansion of the "process 
vectors" step 308 in Figure 3. 
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Figure 7 shows an expansion of the "process pair of 
blocks" step 608 in Figure 6. 

Figure 8 is an alternate expansion of the "process pair 
of blocks" step 608 in Figure 6 that can be used in place 
of the version shown in Figure 7 . 

Figure 9 is an alternate expansion of the "process pair 
of blocks" step 608 in Figure 6 that can be used in place 
of the versions shown in Figures 7 and 8. 

Figure 10 is an alternate expansion of the "perform wear 
leveling" step 208 in Figure 2 that can be used in place 
of the version shown in Figure 3. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

A technique for leveling the wear of a non-volatile 
integrated circuit device will increase the life of the 
5 device by monitoring storage patterns of the device and 
relocating certain data to other storage locations based 
on the activity level of the data and the cumulative wear 
of the storage locations. The* technique works 
particularly well for EEPROM devices such as flash 

10 memory, but the techniques can also be used with other 
types of non-volatile storage devices such as Ovionics 
Unified Memory (OUM) and Magnetoresistive RAM (MRAM) . As 
used herein, wear- leveling or a wear- leveling event is 
the process of moving one or more logical blocks from one 

15 physical location to another to more evenly distribute 
the wear of a non-volatile storage device. 

Flash memory comes in many different sizes such as 
64 Mbit, 128 Mbit, 256 Mbit, 512 Mbit and 1Gbit. 
Technological advances will soon result in even higher 

2 0 storage densities for flash memory. For purposes of the 
preferred embodiment described herein, a representative 
flash memory device having a capacity of 12 8 Mbit and a 
block size of 128 Kbyte will be shown. However, devices 
having other device geometries such as different 

25 capacities and block sizes can also take advantage of the 
wear level techniques described herein, and are well 
.within the scope of the present invention. 

One key aspect of the present invention is the use 
of sorted lists or vectors to keep track of memory 

30 storage usage patterns. These vectors facilitate the 
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identification of promising opportunities for wear 
leveling. Logical blocks that are infrequently 
erased/written can be moved into physical blocks that 
have received a lot of wear, while logical blocks that 
5 are frequently erased/written can be moved into physical 
blocks that have received little wear, thereby leveling 
the future wear of the individual physical blocks within 
the flash memory device. 

Referring now to Figure 1, there is shown a 

10 representative internal architecture of a flash memory 

device 100. In this particular example, the flash memory 
device has a total storage capacity of 128 Mbit organized 
as one hundred and twenty eight 12 8 -Kbyte physical blocks 
(shown by representative physical blocks 102, 104, 106, 

15 108 and 110) . Physical block 0 (shown at 102) has an 

address range of x'000000' - x'OlFFFF', physical block 1 
(shown at 104) has an address range of x' 020000' - 
x'03FFFF', physical block 31 (shown at 106) has an 
address range of x'3E0000' - x'3FFFFF', physical block 63 

20 (shown at 108) has an address range of x'7E0000' - 

x'7FFFFF', and physical block 127 (shown at 110) has an 
address range of x'FEOOOO' - x'FFFFFF'. The other 
physical blocks are not shown for ease of clarity in 
understanding the organization of the flash memory 

25 device. The physical blocks are organized in a byte-wide 
(x8) fashion in this example. The present invention 
keeps track of data access at the block level in the 
preferred embodiment . 

Also shown in Figure 1 is memory controller 120, 

3 0 which contains a CPU or processor 122 coupled to 
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controller memory 124 via bus 127. A non-volatile memory 
cache 12 6 also coupled to bus 127 is used as a temporary 
storage area in the wear-leveling technique described 
below. Memory cache 12 6 can also be used for receiving 
5 updates to the flash memory 100, which allows multiple 
updates to the flash to be captured during interim 
processing without actually updating the flash until the 
overall process has completed. This mitigates additional 
wear to the flash memory that would otherwise occur as 

10 part of the wear-leveling technique. Bridge/driver 
circuit 128 is used to connect internal bus 127 to 
external bus 129, where one or more flash memory devices 
100 can be attached and controlled by controller 120. 

While a single flash memory device 100 is shown in 

15 detail in this figure for simplicity, numerous other 
flash devices such as 130 can be included for systems 
requiring more memory, such as when used in conjunction 
with or in lieu of a disk array or tape library system. 
Memory controller 120 is typically installed as part of a 

2 0 larger computer system (not shown) such as a server or 
other type of data processing system. The techniques 
described herein could also be implemented in a hardware 
macro or similar circuitry that is included as part of 
the flash memory device itself, thus providing 

25 improvements in flash memory lifetime for both high and 
low-end applications. The block wear-leveling technique 
for flash memory devices will now be described. 

Referring now to Figure 2, there is shown a top- 
level flow chart of the present wear-leveling technique 

30 200. This process is executed on memory controller 120 
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of Figure 1 in the preferred embodiment, but could also 
be run in other parts of the overall system. The wear 
level technique starts at 202, and proceeds to step 204, 
where variables n and nO are initialized to a zero (0) 
5 value. As used herein, the variable n maintains a count 
of the total number times a given physical block has been 
erased. The variable nO is used to store the value of n 
for a given physical block at the time the last wear- 
leveling event occurred, as described below in connection 

10 with step 210. There is a separate n variable and nO 

variable for each physical block. For this example, with 
128 physical blocks, there will be 128 n variables and 
128 nO variables. Also shown in step 204 is the 
incrementing of variable n by one each time its 

15 associated physical block is erased. In step 206, a 

couple of other variables are used. a is a number much 
less than one, and a value in the neighborhood of 0.01 
has been determined to be effective for the present 
invention. L is the endurance limit for a given physical 

20 block, or the maximum number of erase cycles that the 
flash device is specified to handle by the device 
manufacturer. For example, many of today's flash devices 
are spec'ed at 100,000 or 1,000,000 erase/write cycles. 
Exceeding this number is not recommended by the 

25 manufacturer in order to ensure reliable operation, a 
multiplied by L (shown in Figure 2, step 206 as ah) is 
thus a threshold value used to determine if a wear 
leveling operation needs to be performed. So, in step 
206, a determination is made on whether the total erase 

3 0 cycles for any physical block minus the number of erase 
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cycles since the last wear leveling operation was 
performed (n - nO) exceeds the wear level threshold. For 
example, assume a is 0,01 and L is 100,000, a 
determination is made at step 206 whether any physical 
5 block has been erased more than 1,000 times (0.01 times 
100,000) since the last wear leveling event. If so, a 
wear leveling operation is performed at step 208, as will 
be described in more detail below. After a wear leveling 
operation has been performed at step 208, the variable nO 

10 for each physical block is set to the current value n for 
that physical block in order to save the value of n at 
the time of this wear-leveling event. This completes the 
wear-leveling operation, and the next wear-leveling 
operation is not initiated until some physical block 

15 experiences another 1,000 erasures (1,000 being the value 
of aL in our example) , as indicated by the test in step 
206. 

It is to be noted that the test in step 206 does not 
refer to the unevenness in the wear distribution, but 

2 0 rather to the largest amount of wear experienced by any 

physical block in the interval since the most recent 
wear-leveling operation. While the present invention is 
concerned primarily with the mechanism of the wear- 
leveling operation itself, this method for deciding when 
25 a wear-leveling operation should be initiated illustrates 
an effective approach, and an approach that guarantees 
that a significant amount of wear (in fact, any desired 
level of wear) occurs between successive wear- leveling 
operations. While this approach can introduce 

3 0 unnecessary wear in environments where unevenness in the 
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wear distribution is not a problem, this overhead wear is 
small owing to the smallness of the parameter a. 

A wear-leveling operation (generally shown at step 
208 in Figure 2) proceeds as shown at 300 in Figure 3. 
5 The first step is to compute the number of times each 
physical block has been erased since the previous wear- 
leveling event. This number is called An and is equal to 
n - nO, as indicated at 304. For each physical block, we 
therefore know its cumulative wear since the device began 

10 operation (n) and its wear since the previous wear- 
leveling event (An) . 

A block descriptor is maintained in the memory 
controller's memory (element 124 of Figure 1) for each 
physical block, and is used to store the variables n, nO, 

15 and An associated with a given physical block (such as 

physical block 102 in Figure 1) . A representative table 
of block descriptors is shown at 22 0 in Figure 4, and 
comprised a plurality of block descriptors 222, one for 
each physical block of flash memory. Each block 

2 0 descriptor maintains values for n, nO and An. For 

example, item 224 is the n value for physical block 0, 
item 226 is the nO value for physical block 0, and item 
228 is the An value for physical block 0. Of course, 
since An is determined using n and nO, in an alternate 

2 5 embodiment An is not separately maintained in the block 

descriptor table but rather dynamically computed using n 
and nO when needed. 

In addition to the values of n, nO , and An, a block 
descriptor may contain other information. One piece of 

3 0 additional information that is needed in some 



17 

Docket NO.2003-080-SFT 

implementations of the present invention is the block ID 
of the logical block that is stored in the physical 
block. 

In order to accommodate power failures to memory 
5 controller 120, the block descriptors could be stored in 
the non-volatile memory cache 126 instead of the volatile 
memory 124. However, this would have the undesirable 
effect of reducing the amount of non-volatile memory that 
is available for use as a write cache. For this reason, 

10 it is preferable to store the block descriptors in the 
volatile memory 124, but to protect against power 
failures in the following way. Each physical block 
contains or includes, in addition to the user data (the 
logical block) , a header where metadata is stored. The 

15 value of n for the physical block is stored in this 
header. This procedure requires a few bytes of each 
physical block to store the value of n, of course, but it 
introduces no wear overhead because the value of n can be 
incremented at the same time this physical block is re- 

2 0 written. The value of n for each physical block is 
therefore always available in the non-volatile flash 
memory 100 (Figure 1) . If a power failure should occur, 
we do lose the values of nO and An, but this is a 
relatively minor loss. We simply set nO for each 

25 physical block to its current value of n, so it is as if 
a wear-leveling event has just completed; the value of An 
is calculated as usual at the time of the next wear- 
leveling event. Thus, the impact of a power failure with 
this protection scheme is no more serious than skipping a 

30 single wear-leveling event. 
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The value of n for a physical block should be . 
incremented and (in the preferred embodiment) stored in 
the block header each time the physical block is re- 
written. In the following description of the wear- 
leveling mechanism, it will be assumed that this is done 
not only at the time of re-writes initiated by the user 
of the data but also at the time of re-writes that are 
part of the wear-leveling process itself. 

Once An has been computed for each physical block at 
the beginning of a wear-leveling event (step 304 in 
Figure 3) , each block descriptor indicates the wear that 
has been experienced by the given physical block since 
the beginning of device operation (n) and the wear that 
has been experienced by the given physical block since 
the previous wear-leveling event (An) . Since logical 
blocks are moved only during wear- leveling operations in 
the preferred embodiment, An also characterizes the 
activity of the logical block that is currently stored in 
the physical block; this is the real significance of An 
for the wear-leveling operation. The assumption is that 
the value of An based on the activity of the logical 
block in the recent past (during the interval since the 
most recent wear-leveling event) will provide useful 
information, at least in a statistical sense, on the 
activity of that logical block in the future. 

In general, wear leveling is achieved by moving 
relatively inactive logical blocks into physical blocks 
that have experienced the most wear to date and by moving 
relatively active logical blocks into physical blocks 
that have experienced comparatively little wear. The 
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information required for these operations is contained in 
the block descriptors once the values of An have been 
computed in step 304. This information is accessed most 
conveniently via pointers to the block descriptors. An 
5 ordered list of all the pointers to the block descriptors 
(one pointer for each physical block) will be called a 
vector. A key feature of the invention is the use of two 
vectors. One vector, denoted by vec_n, is sorted in 
decreasing order of n; that is, the first elements of 

10 vec_n point to the descriptors of the physical blocks 
with the largest values of n, and the last elements of 
vec_n point to the descriptors of the physical blocks 
with the smallest values of n. The other vector, denoted 
by vec_An, is sorted in increasing order of An; that is, 

15 the first elements of vec_An point to the descriptors of 
the physical blocks with the smallest values of An, and 
the last elements of vec_An point to the descriptors of 
the physical blocks with the largest values of An. The 
creation of the vectors vecji and vec_An by the memory 

20 controller 120 is indicated at 306. 

In the present invention, the vectors vec__n and 
vec_An play a central role in the determination of which 
data -movement operations are to be performed for the 
purpose of wear leveling (a data-movement operation being 

25 the relocation of a logical block from one physical block 
to another) . The instruction "process vectors" at step 
308 refers to the selection and execution of these data- 
movement operations under the guidance and control of the 
two vectors, as will be further described below. The 

3 0 wear-leveling process then ends at 310. 
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There is more than one way that the vectors can be 
processed at 308. One way is shown at 500 in Figure 5 and 
another at 600 in Figure 6; the latter way has its own 
variations because there is more than one way to expand 
5 step 608. These alternative implementations will now be 
described, beginning with the vector processing in Figure 
5. 

The algorithm shown in Figure 5 is based on the 
observation that the vectors vec_n and vec_An would be 

10 identical if the assignment of logical blocks to physical 
blocks were ideal from the standpoint of wear leveling. 
That is, the least active logical block (smallest value 
of An) would be stored in the physical block that has 
experienced the most wear to date (largest value of n) , 

15 the second least active logical block would be stored in 
the physical block with the second most wear, and so 
forth. This means that optimum wear leveling can be 
achieved as follows: for each physical block, find its 
location in vec_An (that is, find the element of vec_An 

2 0 that points to the block descriptor of the given physical 
block) and move the logical block that is currently 
stored in this physical block to the physical block at 
the corresponding location in vec_n (that is, to the 
physical block whose descriptor is referenced by the 

25 corresponding element of vec_n) . The algorithm in Figure 
5 is a practical way to perform these data movements 
without losing data and without using more than a minimal 
amount of non-volatile RAM. 

The algorithm is entered at 502, and begins with the 

30 construction of a transfer table at 504. One column of 
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the transfer table lists the physical blocks in the order 
in which they appear in vec_n (meaning, of course, the 
order in which their block descriptors are referenced by 
elements of vec_n) , and the other column lists the 
5 physical blocks in the order in which they appear in 
vec_An. Thus, each row contains the identifiers of two 
physical blocks from the same locations in vec_n and 
vec_An. Any rows in which these two identifiers are the 
same are deleted from the table. Each remaining row 

10 identifies a transfer that is to be performed, that is, a 
relocation of a logical block from the physical block 
referenced in vec__An to the physical block referenced in 
vec_n. For each such transfer, the physical block 
referenced in vec_An will be called the source, and the 

15 physical block referenced in vec_n will be called the 

sink. When a transfer is performed, the associated row 
in the transfer table must be rendered inoperative, 
either by setting a flag or simply by deleting the row 
(the first alternative would require an additional column 

20 in the transfer table to hold the inoperative flag, but 
this column will not be mentioned again; in particular, 
this flag should not be confused with the "third column' 7 
mentioned below, which is used to hold a different flag) . 
In Figure 5 "sink" and "source" are also used in the 

25 sense of variables that keep track of the sink and source 
for the current transfer. At 508 the sink variable is 
assigned to the first sink in the transfer table that has 
not yet received a new logical block. If a row 
specifying a transfer is deleted upon completion of the 

30 transfer, the assignment of the sink variable at 508 
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simply refers to the first remaining sink in the transfer 
table. The second step in 508 is to move the contents of 
the current sink to a temporary buffer in the non- 
volatile memory cache 126, which is called NVRAM in 508. 
5 The last step in 508 is to move the logical block in the 
current source to the sink. 

The transfer table should include a third column to 
hold a flag for each row that indicates whether the 
logical block originally assigned to the source block in 

10 that row has been moved to the NVRAM. At any given time, 
at most one of these flags will indicate that the logical 
block is in the NVRAM. When a logical block in a sink is 
moved to the NVRAM in 508, the physical block currently 
serving as the sink must be located in the source column, 

15 so that the flag can be set in the appropriate row. 

At the conclusion of 508, a transfer has been 
completed leaving a new logical block in the sink and no 
logical block left in the source. The next step is to 
move the appropriate logical block into this source. At 

2 0 510, the physical block that was the source in the 
transfer just completed is now the sink of the next 
transfer. It is therefore necessary to locate the 
identifier of this physical block in the sink column of 
the transfer table. The physical block in the same row 

2 5 of the source column is the new source. The logical 

block in this source is then moved to the sink at the end 
of 510. Of course, the logical block must be obtained 
from the NVRAM if the flag in the third column of the 
transfer table has been set. 
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What happens next depends on whether the logical 
block just transferred was retrieved from the NVRAM, as 
indicated by the test at 512. If the logical block just 
transferred was obtained from a physical block, that 
5 physical block is now empty and must receive its new 
logical block from the appropriate source; this is 
accomplished by the return to 510 when the test at 512 
yields a NO result. On the other hand, if the logical 
block just transferred was retrieved from the NVRAM, all 

10 physical blocks that have participated in the transfers 
up to this point contain the right logical blocks for 
wear leveling, and the wear-leveling process for the 
remaining physical blocks begins with a return to 506. 
Eventually, it will be found upon one of these returns to 

15 506 that the transfer table is empty (all transfers have 
been performed) , in which case the test at 506 yields a 
YES result and the processing terminates at 514. 

The algorithm shown in Figure 5 generates the best 
possible assignment of logical blocks to physical blocks 

2 0 for the purpose of wear leveling. However, the number of 
transfers required to achieve this optimal assignment may 
be excessive; in particular, these overhead transfers may 
add significantly to the wear on the device. Of course, 
the wear- leveling event could be terminated at any time 

25 that the test at 512 yields a YES result, but there can 
be no guarantee that the transfers performed up to that 
point will be sufficient (e.g., the second most worn 
block could still contain an active logical block) . 
These considerations motivate the alternative 

30 approach that is shown at 600 in Figure 6. Like the 



24 

Docket NO.2003-080-SFT 

algorithm in Figure 5, the algorithm in Figure 6 is a 
method for performing the vector processing that is 
called for at step 308 in Figure 3. This new method does 
not achieve the best possible assignment of logical 
5 blocks to physical blocks, but it offers the ability to 
terminate processing when an effective degree of wear 
leveling has been achieved with an acceptable amount of 
overhead wear. 

In the method shown in Figure 6, processing starts 

10 at the beginning of the vectors vec__n and vec_An and 
proceeds systematically toward the end of the two 
vectors. The position in the two vectors at any given 
time is indicated by pointers into the vectors: a pointer 
named Pn points to an element of vec_n, and a pointer 

15 called PAn points to an element of vec_An. For each 
combination of Pn and PAn that is encountered in the 
course of the processing, the block descriptors that are 
referenced by the associated elements of vec_n and vec_An 
are compared. The contents of the two referenced 

2 0 physical blocks (i.e., the logical blocks that are stored 
in the two physical blocks that are described by the two 
block descriptors being compared) are exchanged 
("swapped") when the comparison indicates that this 
exchange will be conducive to wear leveling. After the 

25 comparison of the two block descriptors (and the swap, if 
one was performed) , at least one of the pointers Pn and 
PAn is reset ( "incremented" ) to point to the next element 
of its associated vector (or beyond its associated vector 
if it was pointing to the last element) , and then the new 
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pair of block descriptors is processed (unless a test 
indicates that processing is to be terminated) . 

Thus, after the algorithm at 600 has been entered at 
602, Pn and PAn are set to point to the first elements of 
5 their respective vectors at 604. At 606 a test is 

performed to determine whether processing is to continue 
or whether it is to be terminated. In the particular 
implementation shown in Figure 6, the test shown at 606 
determines whether Pn and PAn both point to elements of 

10 their respective vectors. A NO result for this test 
means that the last element of at least one of the 
vectors has been processed (and the subsequent 
incrementing of the associated pointer has caused it to 
point beyond the end of the vector) , and so processing 

15 must terminate. An implementation designed to avoid 

excessive wear overhead can use a different test at 6 06 
in order to terminate processing at an earlier stage. 
This will be discussed after the current approach to wear 
leveling has been explained in greater depth. 

20 If the test at 606 yields a YES result, indicating 

that processing should continue, the block descriptors 
that are referenced indirectly by Pn and PAn (recall that 
each of these pointers is a pointer to a vector element 
which is a pointer to a block descriptor) are compared, a 

25 swap of logical blocks is performed if the comparison 

indicates that this would promote wear leveling, and at 
least one of the pointers Pn and PAn is reset to point to 
the next element of its associated vector. All this is 
included in step 608, the details of which are described 

3 0 below. The test at 606 is then repeated, with a YES 
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result leading to the processing of another pair of block 
descriptors at 608 and a NO result leading to some final 
processing at 610 (explained below) followed by 
termination of processing at 612. 
5 At this point it is necessary to clarify what is 

meant by a swap of logical blocks . There are two types 
of swaps to be considered: logical and physical. In a 
physical swap, the logical blocks stored in two physical 
blocks are actually exchanged. This is done by copying 

10 both logical blocks into controller memory (preferably 

the non-volatile memory 12 6 in Figure 1) and then writing 
these logical blocks to their new locations in the flash 
memory 100 (each logical block being written to the 
physical block formerly occupied by the other logical 

15 block) . 

In a logical swap, the exchange of the two logical 
blocks does not actually occur, but a record is kept of 
the data movements involved in the swap. This is an 
important technique because a given logical block may be 

20 involved in a number of swaps during the execution of 

600. Instead of moving the same logical block multiple 
times, it is better to keep track of the current 
destination of each logical block, to update this 
destination whenever the given logical block is involved 

25 in another logical swap, and to actually move the logical 
block only once, when its final destination is known. 

The use of logical swaps during the execution of 600 
will now be described in greater detail. The necessary 
record keeping is done in a logical-block table with 

3 0 three entries for each logical block that is to be moved: 
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the block ID of the logical block, the current location 
of the logical block (i.e., the physical block where the 
logical block is stored) , and the future location of the 
logical block. Note that this method requires the block 
5 descriptor to include the block ID of the logical block 
that is stored in the physical block. 

When a comparison of two block descriptors at 608 
indicates that swapping the contents of the two physical 
blocks would be conducive to wear leveling, two rows in 

10 the logical -block table must be created or modified, one 
row for each logical block involved in the swap. If a 
logical block to be moved does not yet appear in the 
table (that is, if the block ID of this logical block 
does not yet appear in the table) , a new row is created 

15 listing the block ID of the logical block, the physical 
block where it is currently stored, and the physical 
block to which it is to be moved during the swap. If the 
logical block to be moved already appears in the table, 
only its future location is modified. 

2 0 Thus, the logical -block table is built during the 

executions of step 608 that discover opportunities for 
wear-leveling swaps. When the test at 606 indicates that 
the processing of the vectors is to be terminated, this 
logical-block table is just like the transfer table of 

2 5 Figure 5 except that it has an additional column (the 

column containing the block IDs of the logical blocks 
that are to be moved) . This additional column serves no 
further purpose and can henceforth be ignored. The two 
remaining columns of the logical -block table constitute 

3 0 the transfer table for this implementation of the 
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invention. To move each logical block to its destination 
(the resultant of all the logical swaps in which the 
logical block was involved) , this transfer table is 
processed at step 610 according to the algorithm given in 
5 Figure 5 (starting at 506 and ending at 514) . 

If each swap that is found to be conducive to wear 
leveling at step 608 is executed physically, no transfer 
table is constructed and step 610 is skipped. To avoid 
unnecessary wear, it is preferable to treat each swap at 

10 step 608 as a logical swap and to process the resulting 
transfer table at step 610. With this understood, it will 
not be necessary to distinguish between physical and 
logical swaps in the following explanations. 

The details of the "process pair of blocks" step 608 

15 are shown at 700 in Figure 7. This processing involves a 
block descriptor that is referenced by an element of 
vec_n (specifically, the element of vec_n to which Pn 
currently points) and a block descriptor that is 
referenced by an element of vec_An (specifically, the 

20 element of vec_An to which PAn currently points) . To 
compare the values of n and An in these two block 
descriptors, the quantities 8n and 8An are defined: 5n is 
the difference between the two values of n for each 
respective block of the pair of blocks, and 5An is the 

25 difference between the two values of An for each 

respective block of the pair of blocks. Specifically, 8n 
is the value of n in the block descriptor that is 
referenced by the element of vec_n to which Pn points 
minus the value of n in the block descriptor that is 

3 0 referenced by the element of vec_An to which PAn points; 
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similarly, SAn is the value of An in the block descriptor 
that is referenced by the element of vec_n to which Pn 
points minus the value of An in the block descriptor that 
is referenced by the element of vec_An to which PAn 
5 points. 

The first step after process 700 is entered at 702 
is the test at 704, which determines whether the product 
of 8n and 8An is positive. If this product is positive, 
either 8n and 8An are both positive or they are both 

10 negative. In either case, the physical block with the 
larger value of n is also the physical block with the 
larger value of An; that is, the more worn physical block 
is also the physical block that contains the more active 
logical block, and wear leveling can be accomplished by 

15 swapping the contents of the two blocks. This swapping 
is shown at 706, which is executed if the product of 8n 
and 8An is positive. 

It is to be noted that the logic of 704 and 706 
would remain valid for any ordering of the two vectors. 

2 0 In fact, the vector concept is not even necessary if one 
would be satisfied with an inefficient method for wear 
leveling: one could pick pairs of block descriptors at 
random, perform the test indicated at 704, and swap the 
contents of the two physical blocks if the test at 704 

25 yielded a YES result. 

It is the ordering of the elements of the vectors 
vec_n and vec_An that makes efficient wear leveling 
possible. The first elements of the two vectors will 
illustrate this point. The first element of vec_n points 

30 to the block descriptor with the largest value of n, 
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while the first element of vec_An points to the block 
descriptor with the smallest value of An. It is 
therefore guaranteed that the most worn physical block 
will receive the least active logical block as a result 
5 of the first execution of 608 during each wear-leveling 
event. The only exceptions occur when the two first 
block descriptors being compared have the same values of 
n and/or the same values of An, in which case swapping is 
pointless and would not occur (since the product of 8n 

10 and 5An would not be positive) . 

To continue the explanation of step 706, it is noted 
that when the contents of two physical blocks are 
swapped, the values of An in their respective block 
descriptors should also be swapped since An measures the 

15 activity of the logical blocks stored in the physical 
blocks. This is necessary because each physical block 
(or rather a pointer to its block descriptor) appears in 
both vec_n and vec_An. A physical block that receives a 
new logical block in a swap may therefore be encountered 

20 in a later execution of 700; the activity level of its 

current contents should be characterized with the correct 
value of An so that 5An will be given its correct value 
at this later execution of 700. 

When the contents of two physical blocks have been 

25 swapped (logically or physically) , and the values of An 
in their respective block descriptors have also been 
swapped (regardless of whether the logical blocks were 
swapped logically or physically) , the processing of the 
pair of block descriptors is complete. The pointers Pn 

3 0 and PAn are then modified to prepare for the processing 
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of the next pair of block descriptors. Specifically, Pn 
is incremented so that it points to the next element of 
vec_n (or beyond vec_n if Pn was pointing to the last 
element of vec_n) ; this is the meaning of ++Pn in 706. 
5 Similarly, PAn is incremented so that it points to the 
next element of vec_An (or beyond vec__An if PAn was 
pointing to the last element of vec_An) ; this is the 
meaning of ++PAn in 706. After 706, the logic flow moves 
to 708, which signals the conclusion of one execution of 

10 step 608 in Figure 6. 

So far one path through the "process pair of blocks" 
operation 700 has been described, namely, the path that 
discovers an opportunity for wear leveling and acts on 
this opportunity at 706 by swapping the contents of two 

15 physical blocks. Now the other paths through 700 will be 
described. 

When the test at 704 yields a NO result, swapping 
the contents of the two physical blocks currently being 
compared is not conducive to wear leveling. One way to 

2 0 handle this case would be to increment both vector 

pointers (++Pn and ++PAn, as in 706) , so that the next 
execution of 700 will involve an entirely new pair of 
physical blocks. However, it is preferable to increment 
only one of the vector pointers, since one of the 
25 physical blocks presently being considered may yet offer 
an opportunity for wear leveling when it is compared with 
a different block. The choice of which vector pointer to 
increment is based on an appraisal of which of the two 
current block descriptors is more likely to provide an 

3 0 opportunity for a wear- leveling swap when it is compared 
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with the next block descriptor in the other vector. The 
two possibilities will now be explained. 

Borderline cases in which either 5n or 5An is zero 
will not be considered. A NO result for the test at 704 
5 then implies that 5n and 5An have opposite signs. There 
are therefore two cases to be considered: (a) 8n is 
positive while 6An is negative (so that the test at 710 
yields a YES result and step 712 is executed) and (b) 5n 
is negative while 5An is positive (so that the test at 

10 710 yields a NO result and step 714 is executed) . The 
latter case will be explained first. 

Let PB_n denote the physical block whose descriptor 
is referenced by Pn, and let PB_An denote the physical 
block whose descriptor is referenced by PAn. If 8n is 

15 negative, PB_An is more worn than PB_n. Owing to the 

order in which vec_n is sorted, this will remain true if 
Pn is incremented to point to the next element of vec_n. 
In a wear- leveling swap, PB_An must therefore receive a 
logical block that is less active than the logical block 

20 it currently holds. However, it is known that PB_An 
already contains a logical block whose activity is a 
reasonable match for the amount of wear that PB_An has 
experienced. This is because PB_An has already been 
encountered in vec_n in an previous execution of 700 

25 (this follows from the order in which vec_n is sorted and 
the fact that 8n is negative) , and at that time the block 
that is now PB_An was compared with a physical block that 
was as close as possible to the beginning of vec_An (and 
therefore had a value of An that was as small as 
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possible) . For this reason, the pairing of PB_An with 
any subsequent physical block in vec_n is not likely to 
offer a good opportunity for a wear-leveling swap. The 
pointer PAn should therefore be incremented, and this is 
5 indicated at 714. 

In the other case to be considered, 5n is positive 
while 5An is negative: PB_n is more worn than PB_An, but 
the logical block stored in PB_n is less active than the 
logical block stored in PB_An . Swapping the contents of 

10 PB_n and PB_An will not promote wear leveling. Instead, 
step 712 indicates that the appropriate action is to skip 
PB_n by incrementing the pointer Pn. This decision is 
based on the judgment that a suitable swapping partner 
for PB_n is unlikely to be found in subsequent elements 

15 of vec_An. The basis for this judgment will now be 
explained. 

First, it will not be possible to find a less active 
logical block for PB_n: the order in which vec_An is 
sorted shows that 5An will remain negative if PAn is 
2 0 incremented. An exception to this rule occurs when a 
value of An has been modified by a swap that occurred 
earlier in the wear-leveling event. However, a less- 
active logical block that is encountered in this way in 
vec__An will be stored in a block with a higher value of n 

2 5 (since the swap that caused the change of An occurred 

when Pn pointed to a previous element of vec_n) , and so a 
swap would not be conducive to wear leveling in this 
case . 

Second, a wear- leveling swap in which PB__n receives 

3 0 a logical block that is more active than its current 
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logical block is also not a promising prospect. In a 
wear-leveling swap, such a logical block would have to 
come from a physical block that is more worn than PB_n, 
and this physical block would have to receive from PB_n a 
5 logical block that is less active than its current 

contents. But any block that is more worn than PB_n has 
been encountered in vec_n in a previous execution of 700, 
and has therefore already had a good opportunity to 
receive a logical block with the least possible activity. 

10 This concludes the explanation of why the block PB_n 

is skipped (by incrementing the pointer Pn) at 712, which 
is executed when 5n is positive and 5An is negative. 

The vector-pointer adjustments performed at steps 
706, 712, and 714 have now been explained. After the 

15 appropriate adjustment, the processing of the pair of 
blocks concludes at 708. If both vector pointers still 
point to elements within their respective vectors, the 
test at 606 in Figure 6 yields a YES result, and the 
resulting execution of step 608 will entail a return to 

20 702 in Figure 7 . 

The algorithm shown in Figure 7 can be summarized as 
follows. The logic leading up to 712 and 714 attempts to 
maximize the opportunities for wear-leveling swaps that 
are encountered as the vectors vec_n and vec_An are 

25 traversed during the execution of the loop controlled by 
the test at 606 in Figure 6. These opportunities are 
detected by a YES result for the test at 704, and the 
swaps are performed at 706. The sorting orders of the 
vectors vec_n and vec_An maximize the productivity of 

30 these swaps since physical blocks with large values of n 
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tend to be paired with blocks with low values of An (near 
the beginning of the vectors vec_n and vec_An, 
respectively) , while blocks with small values of n tend 
to be paired with blocks with high values of An (near the 
5 end of the two vectors) . 

Figures 3, 6, and 7 constitute one embodiment of the 
present invention, with each succeeding figure providing 
the details of a step in the figure that precedes it. In 
this particular embodiment, the only technique that is 

10 used to minimize overhead wear (wear on the device caused 
by the wear-leveling process itself) is the technique of 
logical swaps. Two different methods for further 
reducing overhead wear will now be described. The first 
method to be described is a modification of Figure 7, and 

15 the second is a modification of Figure 3. These two 

methods and the method of logical swaps can be employed 
individually or in any combination. 

Although the sorting orders of the vectors are 
intended to maximize the productivity of swaps, a YES 

20 result for the test at 704 in Figure 7 does not guarantee 
that the resulting swap at 706 will make a significant 
contribution to wear leveling. In particular, the 
benefit will be minimal if either 8n or 8An is small in 
absolute value. Such swaps of minimal benefit are likely 

25 to increase overhead wear without offering an off-setting 
benefit. These undesirable swaps can be eliminated with 
a suitable modification of the test at 704. In 
particular, the full test at 704 should yield a YES 
result only if 8n and 8An have the same sign (which is 

30 equivalent to 8n x 8An > 0, the original condition at 
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704) and the absolute values of both 8n and 5An are 
sufficiently large. For example, if Dn denotes the 
difference between the largest and smallest values of n 
in the population of physical blocks and if DAn denotes 
5 the difference between the largest and smallest values of 
An, the extension to the test at 704 might require that 

the absolute value of 5n exceed some specific fraction of 
Dn and the absolute value of 5An exceed some specific 
fraction of DAn. 

10 An alternative to 700 that avoids swaps of marginal 

value is shown at 800 in Figure 8. The test at 804 shows 
the modification to 704 just described, in which the 
absolute values of 8n and 5An must both meet appropriate 
minimum size requirements for a swap to be performed at 

15 806; the parameters f and g in 804 are numbers between 0 
and 1 whose precise values depend on the desired degree 
of filtering. If 8n and 8An do not have the same sign or 
if at least one of these quantities does not meet the 
minimum size requirement, the test at 804 yields a NO 

2 0 result, and one of the vector pointers is incremented at 
812, 814, 818, or 820. Steps 812 and 814 correspond 
exactly to steps 712 and 714, respectively, of Figure 7; 
that is, the incrementation of Pn at 812 occurs when 8n x 
8An < 0 and 8An < 0, which is precisely when Pn is 

25 incremented at 712, and the incrementation of PAn at 814 
occurs when 8n x SAn < 0 and 8An > 0, which is precisely 
when PAn is incremented at 714. However, steps 818 and 
820 are new; one of them is executed when a swap would be 
conducive to wear leveling (the NO result for the test at 



37 

Docket NO.2003-080-SFT 

808 means that 5n and 5An have the same sign) but is 
avoided because at least one of the size conditions 
imposed at 804 is not satisfied. Whether 818 or 820 is 
executed depends on the outcome of the test at 816, which 
5 will now be explained. 

First, it is to be noted that the test at 816 yields 

a YES result when 6n and §An are both positive but 5An is 

small. The requirement that 5An be positive is not 
mentioned explicitly at 816, but follows from the NO 
10 result for the test at 808 together with the requirement 
that 5n be positive. 

The rationale that was given above to justify step 
712 when 5n is positive but §An is negative also applies 

when 5n is positive and 5An is positive but small. It 
15 follows that Pn should be incremented (as it is at 712) 
when the test at 816 yields a YES result, and this is 
indicated at 818. 

It remains to be shown that PAn should be 
incremented when the test at 816 yields a NO result. The 
20 test at 816 yields a NO result in two cases: if 6n is 

negative (violating the first condition in 816) or if 5An 
is not small (violating the second condition in 816) . If 
8n is negative (the first case) , the rationale that was 
given above to justify step 714 when 5n is negative 
25 continues to apply, and so PAn should be incremented. If 
5n is positive but 5An is not small (the second case) , it 
follows from the NO results for the tests at 804 and 808 

that 8n must be small (that is, the middle condition at 
804 must be violated) . Owing to the order in which vec_n 
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is sorted, incrementing Pn can only decrease 5n. As long 
as 5n remains positive, no swap will be done owing to 

smallness of 5n. If 5n becomes negative, the rationale 
for 714 applies once again, and so PAn should be 
5 incremented. This concludes the rationale for step 820, 
which increments PAn when the test at 816 yields a NO 
result . 

As has already been explained, 800 may be preferred 
over 700 because it reduces overhead wear by eliminating 

10 swaps in which 5n or 5An is small in absolute value. It 
is to be noted that this can be accomplished in ways 
other than that shown at 800. In the method shown at 900 
in Figure 9, for example, the test at 704 (Figure 7) is 
left unchanged at 904, and the magnitudes of 5n and 5An 

15 are examined in a separate test at 906. This approach is 
equally effective as the one shown at 800 (Figure 8) in 
eliminating the swaps of marginal value. However, since 
both pointers are incremented at 916 when such swaps are 
encountered instead of just one pointer being incremented 

2 0 under these conditions at 818 or 820, the approach at 900 
may miss opportunities for productive swaps that are 
captured by the approach at 800. 

A different method for reducing overhead wear will 
now be explained. When the vectors vec_n and vec_An are 

2 5 processed by moving through them systematically from 

beginning to end, as shown in Figure 6, the higher-level 
organization of a wear- leveling event that is shown in 
Figure 3 can be modified to achieve an effective degree 
of wear leveling with less overhead wear (that is, with 

3 0 fewer movements of logical blocks) . Such a modification 
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is shown at 1000 in Figure 10. The concepts required for 
this modification will now be explained. 

The first concept needed to understand Figure 10 is 
the reversal of sorting orders. Up to this point it has 
5 been assumed that the vectors vec_n and vec_An are 
created by sorting the block-descriptor pointers in 
decreasing order of n and in increasing order of An, 
respectively. However, the method of this invention also 
works if the sorting order of each vector is reversed, 

10 since this transformation does not alter the essential 

correlation between the value of n in PB_n and the value 
of An in PB_An, namely, that a high value for one of 
these variables tends to be associated with a low value 
for the other. The only algorithmic change that is 

15 required in order to process the vectors sorted in this 
way is that each appearance of ++Pn in Figures 7, 8, and 
9 should be replaced with ++ PAn, and vice versa; that 
is, Pn should be incremented in 714, 814, 820, and 914 
while PAn should be incremented in 712, 812, 818, and 

20 912. 

The second concept needed to understand Figure 10 is 
iterative vector processing. The systematic traversal of 
the vectors that is described in Figure 6 (in conjunction 
with an appropriate expansion of step 608, such as 700, 

25 800, or 900) , will achieve a certain degree of wear 

leveling, but the resulting mapping of logical blocks to 
physical blocks will not generally be the best possible. 
Improved results can be achieved by iteration: after the 
vectors have been processed (but before the transfer 

30 table is processed in the case of logical swaps) , vec_An 
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is sorted again (since some physical blocks now have 
different values of An as a result of the swaps) and then 
the two vectors (vec_n and vec_An) are processed again. 
This may be repeated any number of times. If swaps have 
5 been performed logically, the transfer table is processed 
(that is, the logical blocks are physically moved) after 
the completion of the desired number of iterations. 

Iterative vector processing can be combined with the 
reversal of sorting orders. After a traversal of the 
10 vectors sorted in one order, both vec_n and vec_An are 
sorted in the reverse order before they are traversed 
again. 

The last concept needed to understand Figure 10 is 
truncated vector processing. According to the test shown 

15 at 606 in Figure 6, processing continues until one of the 
pointers Pn or PAn no longer points to an element of its 
associated vector. However, the test shown at 606 can be 
replaced with another test that will terminate vector 
processing at an earlier stage. A few possibilites are: 

20 a new test at 606 could yield a NO result when the number 
of times that step 608 has been executed reaches some 
predetermined value (perhaps specified as some fraction 
of the total number of physical blocks) ; or the new test 
could yield a NO result when the value of n for PB_n 

2 5 passes some predetermined percentage of the mean value of 
n for all the physical blocks; or the new test could 
yield a NO result when the number of rows in the logical - 
block table (and therefore the number of logical blocks 
that must be moved at step 610) reaches some 

30 predetermined value; and so on. Alternatively, the 
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vectors vec_n and vec_An could be generated with partial 
sorts (so that, in the sorting order assumed in most of 
this discussion, vec_n and vec_An contain only the 
physical blocks with the k largest values of n and the k 
5 smallest values of An, respectively, for some value of 
k) . In the latter approach, vector processing is 
truncated (that is, it is terminated before all the pairs 
of block descriptors have been processed) by building 
shortened versions of the vectors, without changing the 

10 test that is shown at 606. 

The concepts of reversal of sorting orders, 
iterative vector processing, and truncated vector 
processing can be employed individually and in various 
combinations. One combination of particular interest is 

15 motivated by the following observation: the wear-leveling 
operations of greatest importance involve either (a) the 
movement of the least -active logical blocks into the 
most -worn physical blocks or (b) the movement of the 
most-active logical blocks into the least-worn physical 

20 blocks. How the logical blocks of intermediate activity 
are distributed over the physical blocks that have 
experienced intermediate amounts of wear is of lesser 
importance as long as the difference in wear between the 
most worn and least worn blocks is sufficiently small. 

25 It follows that unnecessary data movement (and therefore 
unnecessary wear overhead) can be avoided if the middle 
portions of the vectors are not processed. 

One way to process both ends of the vectors without 
processing the middle is as follows: create the vectors 

3 0 vec_n and vec_An as originally defined (sorted in 
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decreasing order of n and increasing order of An, 
respectively) ; perform a truncated processing of this 
pair of vectors; re-sort vec_n and vec_An into reverse 
order (increasing order of n and decreasing order of An, 
5 respectively) ; and perform a truncated processing of this 
new pair of vectors. The truncated processing of the 
first pair of vectors moves the least-active logical 
blocks into the most -worn physical blocks; the truncated 
processing of the second pair of vectors moves the most- 

10 active logical blocks into the least -worn physical 

blocks. It is this combination of reversal of sorting 
orders, iterative vector processing, and truncated vector 
processing that is shown at 1000 in Figure 10. 

When this organization of the wear-leveling event is 

15 used (instead of that shown in Figure 3) , the wear- 
leveling routine is entered at 1002, and the value of An 
for each physical block is computed at 1004 and recorded 
in the block descriptor. Step 1006 initializes to zero a 
variable i that is used to keep track of the number of 

2 0 iterations that have been performed. The test at 1008 

compares the current value of i with the total number of 
iterations to be performed. Since the value of i is zero 
the first time test 1008 is encountered, this test yields 
a YES result, and the value of i is tested again at 1010. 

25 The test at 1010 determines whether the value of i is odd 
or even. Since i is even (zero) at this point, the test 
at 1010 yields a YES result, and so steps 1012 and 1016 
are executed. The notation "version A" in these steps 
refers to the vectors vec_n and vec_An in their original 

30 sorting orders. The version-A vectors are created at 1012 
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and processed at 1016. With one exception, the 
processing at 1016 is identical to that shown at 600 in 
Figure 6 (together with 700, 800, or 900) . The exception 
is that the transfer table is not processed at this time, 
5 so that step 610 in Figure 6 is skipped. 

After the completion of step 1016, the value of i is 
incremented at 1020 and tested again at 1008. As the 
current value of i (1) is still less than 2, test 1008 
again yields a YES result, and the test at 1010 is 

10 encountered once more. This time i is odd, so test 1010 
yields a NO result, and so steps 1014 and 1018 are 
executed. The notation "version B" in these steps refers 
to the vectors vec_n and vec_An sorted in reverse order. 
The version-B vectors are created at 1014 (by re-sorting 

15 the vectors vec__n and vec_An) and processed at 1018. 

With two exceptions, the processing at 1018 is identical 
to that shown at 600 in Figure 6 (together with 700, 800, 
or 900) . One exception is the same as that noted for 
step 1016: the transfer table is not processed at this 

20 time (step 610 of Figure 6 is skipped) . The other 

difference is that ++Pn and ++PAn, wherever they appear 
in 700, 800, and 900, are interchanged to account for the 
reversed sorting orders of vec_n and vec_An. 

After the completion of step 1018, the value of i is 

25 incremented once more at 1020 and tested again at 1008. 
As the value of i has now reached 2, test 1008 yields a 
NO result, indicating that no more iterations are to be 
performed. The transfer table constructed during the 
vector processing is itself processed at 1022 (i.e., the 

30 logical blocks in the logical -block table are moved to 
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their destination physical blocks) , as previously 
described. The wear-leveling event then terminates at 
1024. 

Several remarks must be made in connection with 
5 Figure 10. The organization of a wear-leveling event 

that is shown in Figure 10 is motivated by the desire to 
process only the beginning and ending portions of the 
vectors vec_n and vec_An in order to minimize unnecessary 
wear associated with relatively unproductive relocations 

10 of logical blocks. Truncation of vector processing is 

therefore essential, but this truncation is not explicit 
in Figure 10. It is to be understood that either the 
vectors vec_n and vec_An are created at 1012 and 1014 in 
abbreviated form by partial sorts, or the test at 606 in 

15 Figure 6 is replaced with a test that will terminate 
processing at a suitable stage (before the middle 
portions of the vectors are reached) , as previously 
described. 

The wear-leveling event described in Figure 10 
20 involves two iterations of vector processing, with the 
vectors sorted in reverse order in the second iteration. 
The purpose of the two iterations is to process different 
ends of the vectors. But the concept of iterative vector 
processing was introduced as a method for improving wear- 
25 leveling results by repeated processing of the same 

vectors (or portions of vectors) . This method can be 
incorporated in Figure 10 simply by replacing the value 2 
in the test at 1008 with a larger number. For example, 
if the test at 1008 were w i < 4?" instead of "i < 2?", 
3 0 both ends of the vectors would be processed twice. 
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The iteration shown in Figure 10 is not the only way 
to process both ends of the vectors without processing 
the middle portions. This can also be accomplished using 
the simpler organization shown in Figure 3 provided 
5 suitable changes are made to the organization of vector 
processing that is shown in Figure 6. However, the 
iterative process shown in Figure 10 may be the simplest 
approach, and the re-sorting of the vectors between 
iterations may improve results since the re-sorting of 

10 vec_An takes into account changes in An due to logical 
swaps performed during the first iteration. 

Thus, a technique has been described for leveling 
the wear of a non-volatile integrated circuit device that 
will increase the life of the device by monitoring 

15 storage patterns of the device and relocating certain 
data to other storage locations based on the activity 
level of the data and the cumulative wear of the storage 
locations. Sorted lists or vectors are used to keep 
track of memory storage usage patterns. These 

20 lists/vectors facilitate the identification of promising 
opportunities for wear leveling. Logical blocks that are 
infrequently erased/written can be moved into physical 
blocks that have received a lot of wear, while logical 
blocks that are frequently erased/written can be moved 

25 into physical blocks that have received little wear, 
thereby leveling the future wear of the individual 
physical blocks within the flash memory device. 

It is important to note that while the present 
invention has been described in the context of a fully 

3 0 functioning data processing system, those of ordinary 
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skill in the art will appreciate that the processes of 
the present invention are capable of being distributed in 
the form of a computer readable medium of instructions 
and a variety of forms and that the present invention 
5 applies equally regardless of the particular type of 
signal bearing media actually used to carry out the 
distribution. Examples of computer readable media 
include recordable- type media such a floppy disc, a hard 
disk drive, a RAM, CD-ROMs, and transmission- type media 

10 such as digital and analog communications links. 

The description of the present invention has been 
presented for purposes of illustration and description, 
and is not intended to be exhaustive or limited to the 
invention in the form disclosed. Many modifications and 

15 variations will be apparent to those of ordinary skill in 
the art. The embodiment was chosen and described in 
order to best explain the principles of the invention, 
the practical application, and to enable others of 
ordinary skill in the art to understand the invention for 

2 0 various embodiments with various modifications as are 
suited to the particular use contemplated. 



